Picture for Jianfu Zhang

Jianfu Zhang

DirectTryOn: One-Step Virtual Try-On via Straightened Conditional Transport

Add code
May 13, 2026
Viaarxiv icon

Enhancing Domain Generalization in 3D Human Pose Estimation through Controllable Generative Augmentation

Add code
May 12, 2026
Viaarxiv icon

Any3DAvatar: Fast and High-Quality Full-Head 3D Avatar Reconstruction from Single Portrait Image

Add code
Apr 15, 2026
Viaarxiv icon

Towards Source-Aware Object Swapping with Initial Noise Perturbation

Add code
Mar 02, 2026
Viaarxiv icon

VTONGuard: Automatic Detection and Authentication of AI-Generated Virtual Try-On Content

Add code
Jan 20, 2026
Viaarxiv icon

Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs

Add code
Jun 08, 2025
Figure 1 for Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs
Figure 2 for Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs
Figure 3 for Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs
Figure 4 for Interpretable and Reliable Detection of AI-Generated Images via Grounded Reasoning in MLLMs
Viaarxiv icon

COCO-Inpaint: A Benchmark for Image Inpainting Detection and Manipulation Localization

Add code
Apr 25, 2025
Figure 1 for COCO-Inpaint: A Benchmark for Image Inpainting Detection and Manipulation Localization
Figure 2 for COCO-Inpaint: A Benchmark for Image Inpainting Detection and Manipulation Localization
Figure 3 for COCO-Inpaint: A Benchmark for Image Inpainting Detection and Manipulation Localization
Figure 4 for COCO-Inpaint: A Benchmark for Image Inpainting Detection and Manipulation Localization
Viaarxiv icon

Towards Explainable Fake Image Detection with Multi-Modal Large Language Models

Add code
Apr 19, 2025
Figure 1 for Towards Explainable Fake Image Detection with Multi-Modal Large Language Models
Figure 2 for Towards Explainable Fake Image Detection with Multi-Modal Large Language Models
Figure 3 for Towards Explainable Fake Image Detection with Multi-Modal Large Language Models
Figure 4 for Towards Explainable Fake Image Detection with Multi-Modal Large Language Models
Viaarxiv icon

InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation

Add code
Apr 15, 2025
Viaarxiv icon

High-Quality 3D Head Reconstruction from Any Single Portrait Image

Add code
Mar 11, 2025
Figure 1 for High-Quality 3D Head Reconstruction from Any Single Portrait Image
Figure 2 for High-Quality 3D Head Reconstruction from Any Single Portrait Image
Figure 3 for High-Quality 3D Head Reconstruction from Any Single Portrait Image
Figure 4 for High-Quality 3D Head Reconstruction from Any Single Portrait Image
Viaarxiv icon